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Executive  Summary 


This  final  report  summarizes  the  findings  of  the  research,  “Segmentation  of  Multi-look,  Multi¬ 
frequency  and  Multi-polarimetric  SAR  data”  supported  by  the  AFOSR  contract  F49620-92-J0130. 
During  the  duration  of  the  project,  we  have  developed  algorithms  for  a)  Markov  Random  Field 
based  segmentation  of  high  resolution  SAR  images,  b)  detection  of  man-made  features  in  SAR 
images  and  c)  labeling,  as  well  as,  grouping  algorithms.  These  algorithms  have  been  integrated  to 
produce  a  2-D  site  model  of  the  given  SAR  image.  The  2-D  site  model  is  an  annotated  description 
of  the  SAR  image  incorporating  natural  and  man-made  features  such  as  trees,  grass,  water,  open 
terrain,  buildings,  roads  and  shadows.  Such  site  models  are  useful  for  delineating  regions  of  interest 
(which  serve  as  focus  of  attention  mechanisms)  and  for  providing  local  context  in  ATR  algorithms. 

The  2-D  site  models  wiU  be  useful  for  the  urban  clutter  mitigation  problem  in  the  ATR  domain. 
Target  detection  in  SAR  images  of  urban  areas,  with  a  low  occurrence  of  false  alarms,  is  a  difficult 
problem  because  of  the  presence  of  a  large  number  of  spurious  bright  returns.  These  arise  mainly 
due  to  the  cardinal  effects  caused  by  the  vertical  waUs/roofs  of  buildings,  as  weU  as  due  to  other 
cultural  objects  which  produce  bright  backscatter.  Several  of  these  false  targets  can  be  eliminated 
by  the  use  of  context-based  arguments  once  we  are  able  to  segment  and  label  the  scene.  Moreover, 
target  detection  algorithms  can  be  cued  to  look  on  roads  and  clearings,  once  these  have  been 
delineated  in  the  SAR  image,  resulting  in  computational  savings.  We  have  begun  interactions 
with  Dr.  Les  Novak  of  MIT  Lincoln  Laboratory  on  integrating  the  2-D  site  models  in  their  ATR 
algorithm.  Another  potential  application  of  our  work  is  in  SAR  seeker.  AU  of  our  algorithms 
have  been  tested  on  the  1'  x  1'  high  resolution  fuUy  polarimetric  SAR  data  collected  at  Lincoln 
Laboratory.  We  acknowledge  the  assistance  of  Dr.  Les  Novak  in  providing  this  data. 

The  research  contract  partially  supported  the  Ph.D.  thesis  work  of  Mr.  Shyam  Kuttikkad.  He  is 
expected  to  graduate  in  December  1996. 

Publications  resulting  from  this  contract  are: 


1.  E.  Rignot  and  R.  CheUappa,  “  Segmentation  of  Polarimetric  Synthetic  Aperture  Radar  Data”, 
IEEE  Trans.  Image  Processing,  Vol.  IP-1,  pp.  281-300,  July  1992. 

2.  E.  Rignot  and  R.  CheUappa,  “Maximum  a  posteriori  Classification  of  Multifrequency,  Multi¬ 
look  Synthetic  Aperture  Radar  Intensity  Data”,  Jl.  Optical  Society  of  America,  A,  Vol.  10, 
pp.  573-582,  April  1993. 

3.  S.  Kuttikkad  and  R.  CheUappa,  “Non-Gaussian  CFAR  Techniques  for  Target  Detection  in 
High  Resolution  SAR  Images”,  Proc.  First  Inti.  Conf.  on  Image  Processing,  Austin,  TX,  pp. 
910-914,  Nov.  1994. 

4.  R.  CheUappa,  “Image  Understanding  and  Statistics:  What  each  can  do  for  the  other?”  (In¬ 
vited  paper).  Joint  Statisticians  Meeting,  Toronto,  August  1994. 

5.  S.  Kuttikkad  and  R.  CheUappa,  “Building  Wide  area  2-D  Site  Models  from  High  Resolution 
SAR  Images”,  Inti.  Symposium  on  Computer  Vision,  Coral  Gables,  FL.  Nov.  1995. 
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2  Polarimetric  Feature  Vector  Based  Segmentation 


In  [5],  Derin  et.  al.  have  presented  stochastic  models  for  single-  and  multi-look,  intensity  and 
complex,  single  polarization  data,  and  developed  maximum  a  posteriori  segmentation  algorithms 
for  each  case.  Single  polarization  segmentation  schemes  do  not  completely  account  for  the  influence 
of  targets  and  terrain  on  the  polarization  of  the  electromagnetic  signal.  In  order  to  exploit  aU  the 
polarization  information,  fuUy  polarimetric  methods  have  to  be  used.  FuUy  polarimetric  radars 
permit  the  measurement  of  the  complete  polarimetric  scattering  matrix  of  each  resolution  cell  in 
the  region  being  imaged.  One  common  technique  for  the  segmentation  of  fuUy  polarimetric  data 
involves  the  statistical  modeling  of  the  polarimetric  measurement  vector,  followed  by  an  estimation 
of  the  region  labels. 

The  simplest  model  for  the  polarimetric  complex  data  is  the  multivariate  circular  Gaussian  distri¬ 
bution,  which  was  considered  in  [12,  19].  This  model  allows  for  the  easy  specification  of  marginal 
densities  and  characterization  of  the  statistical  properties  of  the  data  in  terms  of  first  and  sec¬ 
ond  order  moments,  while  providing  simple  closed  form  expressions  for  various  estimates  of  the 
region  labels.  In  [12]  a  maximum  likelihood  (ML)  estimate  was  obtained  for  the  region  labels. 
A  Markov  Random  Field  (MRF)  model  was  proposed  in  [19]  to  obtain  a  maximum  a  posteriori 
(MAP)  estimate  of  the  region  labels,  leading  to  a  smoother  segmentation. 

Let  Xg  =  [I!  ff,  jffV,VV]s  denote  the  single  look  polarimetric  measurement  vector  at  site  s.  The 
VE  polarization  term  is  not  present  because  it  is  symmetrized  with  respect  .to  EV  during  data 
calibration.  Each  component  of  Xg  is  a  complex  number  of  the  form  EV  —  \EV\  exp{i(^^fv}.  The 
conditional  distribution  of  the  polarimetric  measurement  vector  Xg,  given  it’s  region  label  is 
assumed  to  be  circular  Gaussian  : 

V^Xg/Lg  =  /)  =  ^exp{-X:Cr'^Xj  -  log\Ci\]  (1) 

where  Ls{~  I,  /  G  {1, 2, . . .,  A^}),  is  the  set  of  region  labels.  Superscripts  *  and  T  denote  conjugation 
and  transposition  respectively.  Ci  =<  X*^X  >i  is  the  3x3  polarimetric  covariance  matrix,  with 
<  >;  denoting  ensemble  averaging  over  the  region  1.  The  model  of  (1)  assumes  that  each  region  has 
stationary  backscatter  statistics. 


2.1  Maximum  Likelihood  Estimation  of  Region  Labels 


For  azimuthaUy  symmetric  targets,  which  is  the  case  of  most  natural  targets,  the  cross-polarized 
return  is  uncorrelated  with  the  two  co-polarized  returns,  giving  rise  to  the  covariance  matrix 


where 
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Cl  can  be  inverted  analytically  and  substituted  in  (1)  to  give 
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Regions  of  the  radar  image  can  be  completely  characterized  by  the  parameters  of  the  covariance 
matrix.  These  parameters  can  be  selected  using  a  supervised,  semi-supervised,  or  unsupervised 
method.  A  clustering  routine  on  an  appropriately  selected  feature  vector  can  be  used  for  the 
unsupervised  selection  of  each  region’s  parameters.  The  problems  with  this  scheme  are  the  com¬ 
putational  cost  of  the  clustering  technique,  performance  variation  across  images,  and  the  need  to 
fine-tune  the  cluster  parameters.  In  the  semi-supervised  scheme,  the  operator  specifies  the  possible 
number  of  cluster  centers.  Thereafter,  an  iterative  algorithm,  which  alternately  sphts  the  training 
data  into  the  specified  number  of  classes  and  then  computes  their  cluster  centers,  is  used  to  find  the 
best  possible  cluster  centers  from  the  training  data.  Both  these  methods  suffer  from  the  drawback 
that  the  resulting  cluster  centers  do  not  necessarily  represent  the  polarimetric  characteristics  of 
actual  regions  like  vegetation,  roads,  shadows,  etc.  The  supervised  method  requires  training  areas 
and  some  knowledge  of  the  ground  truth.  Although  the  segmentation  results  are  sensitive  to  the 
selected  training  areas,  this  method  is  not  very  restrictive  since,  during  a  particular  data  collection 
run,  we  can  assume  that  the  characteristics  of  the  various  classes  are  reasonably  stationary.  The 
covariance  matrix  parameters  for  each  region  are  estimated  by  obtaining  their  ML  estimate  over  a 
set  of  data  points  which  are  known  to  belong  to  that  region. 

It  is  assumed  that  as  we  move  a  shding  window  of  size  Ng  across  the  entire  image,  the  region 
label  within  the  window  remains  the  same.  The  joint  distribution  of  the  polarimetric  vectors  of 
a  small  neighborhood  set,  Ng,  oi  s  is  considered,  to  improve  classification  accuracy.  The  spatial 
correlation  between  the  polarimetric  vectors  of  Ng  is  neglected  at  the  cost  of  a  slight  reduction 
in  segmentation  accuracy,  but  with  a  significant  saving  in  computational  cost.  The  interaction 
between  neighboring  pixels  is  taken  into  account  by  the  Markov  Random  Field  model.  Under  the 
uncorrelated  assumption,  the  conditional  distribution  of  the  entire  polarimetric  data  array  X,  given 
the  label  array  L  is  : 

p{XlL)  =  ^p{XglLg)  (5) 
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and  the  conditional  distribution  of  the  measurement  vector  at  each  pixel  is 

piXgjLg  =  /)  =  n  viXilk  =  1)  (6) 


Using  a  Gibbs  representation  of  (6),  we  have 


p{XglLg  =  /)  = 


^xp{-NU{{XglLg  =  /)} 


w'here  the  energy  function  Uf  is 
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The  ML  approach  to  region  labehng  involves  maximization  of  the  LHS  of  equation  (7).  This  is 
equivalent  to  assigning  that  label  to  a  particular  pixel  which  minimizes  the  energy  function  of 
equation  (8).  Since  the  ML  approach  assigns  equal  prior  probability  to  each  region,  the  result  may 
appear  noisy  and  pixels  in  the  same  region  may  be  left  unconnected.  The  resulting  segmentation 
can  be  made  smoother  by  incorporating  spatial  interactions  into  this  model. 


2.2  Maximum  A  Posteriori  Estimation  of  Region  Labels 

The  MRF  model  is  used  to  describe  the  spatial  interaction  between  neighboring  pixels.  According 
to  this  model,  the  conditional  distribution  of  the  region  label  Lg,  given  the  region  labels  elsewhere, 
depends  only  on  the  labels  of  an  immediate  neighborhood,  and  is  given  by  : 

p{Ls  =  e  A°)  =  ^  exp{-iV{7|(i,  =  €  iV°)}  (10) 

where  the  Gibbs  energy  function  U2  is  : 

UiiLs  =  l/Lr,re  N^)  =  -4  E  ^k{Ls  -  X.)  (11) 

/?  is  a  positive  constant  which  was  empirically  chosen  between  [1.0-1. 6],  [19],  independent  of  the 
data  set,  for  good  segmentation  accuracy.  (3  =  0  corresponds  to  the  ML  estimate  and  a  large  /3 
leads  to  excessive  smoothing.  is  a  neighborhood  of  s,  of  N  elements,  excluding  s  and  6k  is  the 
Kronecker  delta.  Z2  is  a  positive  normalizing  constant,  independent  of  1.  iV°  is  chosen  to  be  the 
same  as  Ng  for  consistency. 

The  posterior  distribution  of  region  label  Lg  given  the  single  look  polarimetric  vector  Xg  and  the 
region  labels  Lr  of  the  neighborhood  is  (since  Xg  and  are  independent)  : 


p{Lg  =  l/Lr;r£N°,Xg)  = 


p{Lg  =  l/Lr-,r€N°)p{Xg/Lg  =  l) 


P{Xs) 


Since  Xg  is  known,  p{Xg)  is  just  a  constant  and  ; 


piLg  =  l/Lr-,re  N°,Xg)  oc  exp{-NUt{Xg/Lg  =  1)  -  NUi{Lg  =  e  A°)}  (13) 

The  posterior  distribution  of  the  entire  label  array,  given  the  entire  polarimetric  data  array  is  : 

p{L/X)  (X  exp{-iV  J2[Ut{Xg/Lg  =  l)  +  U^{Lg  =  //i,;  r  £  A°)]}  (14) 


The  MAP  estimate  corresponds  to  the  minimization  of  the  MAP  energy  function  : 

Emap  =  Y.\Pi^^s/Lg  =  0  +  U^{Lg  =  IIW,  r  G  N°)] 
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This  is  a  non- convex  function  which  can  be  minimized  by  using  the  simulated  annealing  [8]  algo¬ 
rithm.  Other  approximate  techniques  such  as  iterated  conditional  mode  [2]  and  maximum  posterior 
marginal  [15]  can  also  be  used.  We  have  used  the  iterated  conditional  mode  technique  [14]. 

Figures  1  (a)  and  (b)  show  two  SAR  images  with  cultural  features  like  buildings,  roads,  fields,  etc. 
The  result  of  ML  and  MAP  segmentation  are  shown  in  Figures  2  and  3.  The  images  in  (a)  have 
been  segmented  into  4  regions,  while  the  ones  in  (b)  have  been  segmented  into  5  regions. 


3  CFAR  Processing  for  Man-made  Feature  Detection 


Target  detection  in  SAR  images  can  be  done  by  comparing  the  received  backscatter  intensity  to  a 
fixed  threshold,  if  only  thermal  noise  is  present.  But  if  the  target  is  embedded  in  spatially  varying 
background  clutter,  as  is  common  in  SAR  terrain  imagery,  an  adaptive  threshold  has  to  be  used  to 
keep  the  probability  of  false  alarm,  a  constant.  CFAR  techniques  are  useful  for  detecting  metallic 
objects  and  other  strong  reflectors  like  vehicles,  power-line  towers,  metal  guard  rails  on  bridges, 
etc.  The  adaptive  threshold,  with  which  the  output  of  each  pixel  is  compared,  is  generated  using 
a  weighted  combination  of  the  outputs  obtained  from  a  moving  window  of  reference  cells  from 
the  background.  The  background  clutter  is  assumed  to  have  an  underlying  statistical  distribution. 
The  reference  cells  are  used  to  obtain  estimates  of  the  parameters  of  the  underlying  statistical 
distribution  of  the  clutter. 

The  traditional  circular  Gaussian  assumption  for  complex  backscatter,  which  results  in  a  Rayleigh 
magnitude  distribution,  is  not  a  good  fit  for  data  obtained  at  low  grazing  angles  and  from  high 
resolution  radars  [23].  Moreover,  in  order  to  ensure  that  the  presence  of  an  extended  target  does 
not  affect  the  threshold  computation,  the  reference  cells  must  be  spatially  well  separated  from 
the  test  cell.  This  tends  to  make  the  empirical  distribution  of  the  reference  cells  more  spiky  than 
predicted  by  the  Rayleigh  magnitude  distribution.  Distributions  with  larger  tails  and  larger  stan¬ 
dard  deviation-to-mean  ratio,  than  Rayleigh,  seem  to  match  the  empirical  distribution  better.  The 
Lognormal,  WeibuU  and  K-distributions  satisfy  these  conditions,  but  the  Lognormal  distribution 
tends  to  overestimate  the  range  of  variation  obtained  from  real  clutter.  Thus,  the  WeibuU  and 
K-distributions  are  of  interest  for  clutter  modeling,  because  they  lie  between  the  two  extremes  of 
Rayleigh  and  Lognormal,  and  they  include  the  Rayleigh  distribution  as  a  special  case. 

Several  types  of  CFAR  techniques  have  been  suggested  in  the  Uterature  [7,  10,  26,  20,  25,  18],  based 
on  the  method  of  obtaining  the  adaptive  threshold  from  the  reference  window.  The  CeU  Averaged 
CFAR  (CA  CFAR)  is  the  simplest  of  these,  where  aU  ceUs  in  the  reference  window  are  used  to 
compute  the  clutter  parameters,  and  hence,  the  adaptive  threshold.  The  CA  CFAR  technique 
assumes  that  the  ceU  under  test  is  the  only  possible  target  while  the  reference  ceUs  are  purely 
clutter.  This  detector  does  not  perform  very  weU  when  the  reference  window  overlaps  with  parts 
of  the  same  extended  target  or  other  targets,  interfering  with  the  computation  of  the  threshold.  In 
an  Order  Statistic  CFAR  (OS  CFAR)  detector,  an  order  statistic  of  the  reference  ceU  outputs  is 
used  to  compute  the  threshold.  The  OS  CFAR  technique  works  better  in  extended/multiple  target 
situations  because  a  suitably  chosen  order  statistic  is  more  robust  in  the  presence  of  additional 
targets.  Other  CFAR  techniques,  such  as  Greatest  of  CFAR  and  Censored  CFAR,  do  not  offer 
significant  advantages  over  the  OS  CFAR  technique  for  the  types  of  images  we  are  using.  In  [16]  and 
[17],  polarimetric  data  has  been  combined  into  a  single  channel  prior  to  applying  a  two-parameter 
CFAR,  but  we  restrict  our  attention  here  to  single  polarization  CFAR  processors. 
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Figure  1:  Original  SAR  image  with  buildings  and  roads 


Figure  2:  Result  of  ML  region  labeling 


Figure  3:  Result  of  MAP  region  labeling 
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3.1  CFAR  Detection  in  Non-Gaussian  Clutter 


Under  the  assumption  that  the  radar  backscatter  is  the  result  of  the  superposition  of  several  inde¬ 
pendent,  identically  distributed  scatterers,  the  in-phase  and  quadrature  components  of  the  clutter 
returns  become  jointly  Gaussian  processes,  as  a  consequence  of  the  Central  Limit  Theorem  [9]. 
The  resulting  backscatter  magnitude  (intensity)  is  Rayleigh  (exponentially)  distributed.  Several 
CFAR  processors  for  this  distribution  have  been  derived  in  the  literature  [7,  20].  As  mentioned 
before,  empirical  clutter  magnitude  distributions  for  high  resolution  radars  lie  between  Rayleigh 
and  log-normal  in  terms  of  spikiness.  The  WeibuU  and  K  distributions  satisfy  this  requirement  and 
also  include  the  Rayleigh  magnitude  distribution  as  a  special  case. 

In  a  CFAR  detector,  the  voltage  in  the  ceU  under  test  is  compared  with  an  adaptive  threshold 
derived  from  a  moving  window  of  reference  cells  from  the  background.  CFAR  processing  can  be 
posed  as  a  binary  hypothesis  testing  problem  : 

Hq  :  Target  absent  (clutter  only) 

H\  :  Target  present 

with  the  decision  criterion 

TTo  <  TZ  (16) 

Ho 

where  ttq  is  the  detector  output  for  the  ceU  under  test,  ^  is  the  test  statistic  derived  from  the 
outputs  of  the  M  ceUs  in  the  reference  window,  and  r  is  the  adaptive  threshold  multiplier. 

In  the  general  case,  given  a  test  statistic  Z,  with  pdf  fz{z),  the  probabiUty  of  false  alarm,  Pfa^ 
and  the  threshold  multiplier,  r,  are  related  by 

roo 

Pfa  =  /  Prob^o  >  Tz]fziz)dz  (17) 

Jo 

where  ttq  is  the  detector  output  for  ceU  under  test,  and  Pro[.]  is  the  probabiUty  under  the  nuU 
hypothesis. 

The  test  statistic  for  the  CA  CFAR  detector  is  the  average  of  the  voltages  from  aU  the  reference 
ceUs.  The  resulting  pdf  is  the  N-fold  convolution  of  the  pdf  of  a  single  ceU.  For  the  OS  CFAR 
processor  (based  on  a  single  ranked  sample),  the  test  statistic  is  the  kth  order  statistic  from  M 
reference  ceUs.  The  pdf  of  the  A:th  order  statistic  is  [4] 

Fk{u)  =  k  fo{u)Fo{u)'^-^[l  -  Foiu)]^-'^  (18) 

where  Fo{.)  and  /o(0  are  the  univariate  clutter-only  cumulative  distribution  function  (cdf)  and  pdf 
respectively  (under  the  nuU  hypothesis). 


3.2  WeibuU  Clutter 

The  WeibuU  model  has  been  suggested  for  sea  and  ground  clutter  magnitude  at  high  resolutions 
and  low  grazing  angles  [21,  22].  The  output  of  the  magnitude  detector  is  modeled  by  the  WeibuU 
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nj.G-\  r  /  \  C 

!x{^)  =  — ^g  -exp  -  j  U{x)  (19) 

where  B  is  the  scale  parameter,  C  is  the  shape  parameter,  and  U{x)  is  the  unit  step  function.  The 
scale  parameter  is  related  to  the  average  clutter  power,  Pq  by 

=  (20) 


where  r(.)  is  the  Eulerian  gamma  function.  The  Rayleigh  distribution  is  a  special  case  of  this 
two-parameter  distribution,  with  C  —  2.  Of  particular  interest  to  us  are  WeibuU  densities  with 
shape  parameter  values  between  1  and  2,  i.e.,  between  the  exponential  and  Rayleigh  densities.  If 
the  shape  parameter  is  known  a  priori,  there  is  a  simple  expression  for  the  adaptive  threshold  for 
the  OS  CFAR  processor  with  a  single  ranked  sample  [13].  Generally,  when  both  the  shape  and 
the  scale  parameter  are  unknown,  there  is  a  need  to  estimate  them  simultaneously.  Dubey  [6] 
and  Cohen  [3]  derived  estimators  for  the  WeibuU  parameters  based  on  rank-order  statistics  and  an 
iterative  solution  to  the  ML  equation,  respectively. 

The  ML  CFAR  [18]  uses  the  solution  of  the  iterative  ML  equation  to  estimate  the  shape  parameter 
and  obtains  the  scale  parameter  from  this  estimate.  The  foUowing  set  of  equations  are  used  to 
estimate  B  and  C  in  [3]  (the  estimates  are  denoted  as  B  and  C  respectively): 


Em=l  xglnXr, 

Z^m=l  •''m 


1  ^  1 
—  —  In  oTto  =  — 


/,  M 

(m  £ 

where  M  is  the  number  of  reference  ceUs  considered  for  estimation  and  Xm  is  the  output  of  the  mth 
reference  ceU.  The  resulting  adaptive  threshold.  Tea,  and  the  probability  of  false  alarm,  PpA-,  are 
related  by 


Pfa  =  exp  - 


Under  WeibuU  intensity  assumption  about  the  backscatter,  with  unknown  shape  and  scale  parame¬ 
ters,  [25]  describes  a  CFAR  algorithm,  based  on  two  ranked  samples  from  the  output  of  a  square-law 
detector.  The  adaptive  CFAR  threshold  is  given  by 

T  =  (24) 

where  yi  and  yj  (i  <  j)  are  the  rank-ordered  square-law  detected  outputs  from  the  background 
clutter  and  /3  is  a  function  of  i,  j,  and  the  Pfa-  Since  a  random  variable  generated  by  squaring 
a  WeibuU  distributed  random  variable,  is  itself  WeibuU  distributed  with  different  parameters,  and 
since  the  CFAR  threshold  is  independent  of  the  distribution  parameters,  (24)  is  also  appUcable  to 
the  threshold,  r,  corresponding  to  clutter  magnitudes  Xi  and  Xj  : 
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Dubey  [6]  derived  an  estimator,  C,  of  the  shape  parameter  based  on  two  ordered  samples  from  a 
set  of  WeibuU  magnitudes  : 

C  =  ln(l  -  hj)]  -  ln[-  ln(l  -  hj)] 

In  Xj  —  In  Xi 

where  hk  =  k/{M  +  1)  and  M  is  the  total  number  of  samples.  Dubey  showed  that  the  smallest 
variance  for  this  estimate  was  obtained  when  hi  =  0.1673  and  hj  —  0.9737.  The  corresponding 
values  of  i  and  j  are  used  in  (25). 

The  exact  integral  relation  between  (3  in  (25)  and  was  derived  in  [25]  using  the  joint  density 
of  the  ith  and  jth  ordered  samples,  and  is  given  by 


Pfa  = 


foo  roo 

— - — -r: .  - ry:  /  /  exp  I I  [1  -  exp(-u)]* 

-j)\Jy=oJu^o  L  1^ 

exp(— u)  [exp(— n)]^”-''*'^  [exp(— u)  —  exp(— ■y)]'^~*“^  U{v  —  u)dudv 


Levanon  and  Shor  [13]  gave  an  approximate  expression  for  (3  in  terms  of  the  parameter  a,-: 


In  [- ln(l  —  hj)]  -  In  [- ln(l  -  hi)]  ^ 

where  a,'  is  the  scale  factor  of  the  OS  CFAR  for  exponential  magnitude  [20],  and  is  related  to  the 
false  alarm  probability  by 

M!(a.  +  M  -  i)! 

(M  -  >)!(ai  +  M)\ 

The  discrepancy  between  the  actual  and  approximate  expressions  decreases  with  increasing  false 
alarm  probability.  The  approximate  expression  is  easier  to  handle  computationally  and  we  have 
used  it  in  our  experiments.  Figure  4  shows  an  HH  polarization  SAR  image  of  a  highway  over-pass. 
The  result  of  WeibuU  CFAR  processing  with  a  PpA  =  lO”^  appUed  to  this  image  are  shown  in 
Figures  5  and  6. 


3.3  K-Distributed  Clutter 

The  K  distribution  has  been  suggested  for  experimental  sea  clutter  [24]  and  ground  clutter  [27]. 
The  K  distribution  arises  when  a  complex  Gaussian  process  is  modulated  by  a  chi  distributed 
process.  Alternately,  it  can  be  shown  to  arise  from  modulating  the  power  of  a  Rayleigh  magnitude 
process  with  a  gamma  distributed  variable.  This  is  the  case  when  the  clutter  voltage  in  a  given  ceU 
exhibits  rapid  Rayleigh  fluctuations,  whose  mean  varies  slowly  over  time  according  to  the  gamma 
distribution.  The  two  parameter  K  pdf  is  given  by 

fx{x)  =  |^^(cx)‘'JW_i(2cx)  TJ{x)  (30) 

where  v  is  the  shape  parameter,  AV(-)  is  the  modified  Bessel  function  of  the  second  kind  of  order 
p,  and  c  is  a  power  parameter  related  to  the  mean  clutter  power,  Pq?  by  Po  =  When  u  =  oo, 
this  reduces  to  the  Rayleigh  pdf.  The  shape  parameter  controls  the  spikiness  of  the  clutter,  with 
lower  values  of  p  implying  more  spiky  clutter. 
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Figure  4:  Original  highway  overpass  image 


For  specific  values  of  the  shape  parameter,  namely  =  m  +  1/2,  m  =  0, 1, 2, . . there  exist  simple 
expressions  for  the  pdf  of  the  test  statistic  [1].  Since  large  values  of  v  result  in  the  Rayleigh 
amplitude  pdf,  and  small  values  give  rise  to  the  more  spiky  clutter  that  we  are  interested  in,  we 
wiU  restrict  our  interest  to  shape  parameter  values  of  0.5  and  1.5. 


In  general,  given  the  PDF,  pw{w),  of  the  test  statistic  W,  the  probability  of  false  alarm  is  given 
by  [1] 

/•CO  (  onp  'j 

Pfa  =  \jj^{TwyK,{2CTw)^pw{w)dw  (31) 

For  the  CA  CFAR  detector,  the  test  statistic  is  the  sum  of  M  correlated  reference  cell  outputs, 
each  of  which  is  a  K-distributed  random  variable.  Closed  form  expressions  (for  specific  values  of  u) 
for  pw{w)  are  given  in  [1].  The  resulting  PDF  for  u  =  3/2  is  reproduced  here  as  an  example 


\2M 


p^{w)  = 


-W 


T{2M) 

From  (18)  and  (30),  the  pdf  of  the  fcth  order  statistic  of  the  K  distribution  is 


fviy)  =  ‘^ck 


r  1 

l^icyyi 

U ) 

i  r(r^) 

K^-i{2cy)K^  ^{2cy) 


(32) 


(33) 


which  can  be  simplified  for  z/  =  m  +  1/2,to  =  0, 1,2, _ In  particular,  for  i/  =  1.5  we  have 


fY{y)  =  Ac‘^k(^]exp[-2c{M-k  +  l)y]{l  +  2cy)^  ^\l-{l  +  2cy)e 


k—1 


(34) 


Substituting  the  above  expressions  for  the  OS  CFAR  test  statistic  pdf  in  (31)  gives  the  relationships 
between  the  probability  of  false  alarm  and  the  adaptive  OS  CFAR  threshold  multiplier  for  if  =  1.5. 
In  general,  these  equations  have  to  be  solved  numerically  to  obtain  the  appropriate  threshold.  The 
result  of  CFAR  processing  with  a  Pfa  =  10“^  and  nominal  f  =  1.5  applied  to  the  over-pass  image 
of  Figure  4  are  shown  in  Figures  7  and  8. 


Figure  5:  WeibuU  ML  CFAR;  Pfa  =  Figure  6:  WeibuU  OS  CFAR;  Pfa  = 


Figure  7:  K  CA  CFAR;  f  =  1.5;  Pfa  =  10'^  Figure  8:  K  OS  CFAR;  f  =  1.5;  Pfa  =  10"^ 


4  Modified  Labeling  and  Site  Model  Construction 

As  mentioned  earlier,  purely  statistical  segmentation  schemes  do  not  perform  well  in  regions  con¬ 
taining  man-made  structures.  Several  observations  can  be  made  about  real  SAR  terrain  data.  At 
low  to  medium  depression  angles,  the  vertical  walls  of  buildings,  facing  the  sensor,  produce  small 
areas  of  backscatter.  Buildings  are,  in  general,  characterized  by  large  shadow  regions,  bounded  by 
bright  L-shaped  or  linear  streaks  (cardinal  effects)  toward  the  direction  of  the  sensor.  These  bright 
streaks  often  show  up  as  false  alarms  in  target  detection  algorithms.  Also,  even  when  supervised 
ML  segmentation  algorithms  (with  training  areas)  are  used,  there  is  a  tendency  to  misclassify  pixels 
as  parts  of  roads.  The  fact  that  roads  are  ribbon-shaped  strips  can  be  used  to  get  rid  of  most  of 
these  false  alarms.  Similarly,  the  fact  that  at  high  resolutions  and  low  to  medium  depression  angles, 
trees  produce  shadows  in  radar  images  can  be  used  to  reduce  misclassification  of  pixels  as  trees. 
Finally,  it  is  difficult  to  differentiate  between  shadow  regions  and  calm  bodies  of  water  in  SAR 
images,  because  they  produce  near-zero  backscatter  at  off-nadir  depression  angles.  Size-  ,  shape- 
or  context-based  arguments  need  to  be  used  to  effectively  resolve  shadows  and  calm  water. 

We  incorporate  these  rules  and  other  information  about  the  sensor  location,  heading  and  resolution 
to  build  an  approximate  2D  site  model  of  the  terrain  being  imaged.  This  site  model  is  constructed 
on  the  slant  range  plane.  The  complete  algorithm  for  the  detection  of  targets,  segmentation,  and 
building  of  a  two-dimensional  site  model  from  high  resolution,  polarimetric  SAR  data  is  given 
below.  Most  of  the  morphological  and  other  primitive  operations  mentioned  below  are  described 
in  greater  detail  in  [11]. 

•  Target  detection 

A  two-pass  Order  Statistic  CFAR  detector  is  used  for  target  detection.  Non-Rayleigh  OS 
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CFAR  detectors  are  better  suited  for  target  detection  in  high  resolution  spiky  clutter.  CFAR 
detection  is  initially  done  independently  on  the  three  polarimetric  channels,  with  a  low  false 
alarm  probabiUty,  typically  10“^.  The  resulting  binary  images  images  are  logically  OR-ed. 
This  conservative  approach  is  suitable  for  extracting  most  candidate  target  pixels  from  any  of 
the  polarization  channels.  Later,  a  second  CFAR  processing  step,  utilizing  a  higher  false  alarm 
probabiUty  (typicaUy  10“^),  is  performed  only  in  the  immediate  neighborhoods  of  previously 
detected  targets.  We  chose  a  5  X  5  neighborhood  around  each  target  pixel  detected  after  the 
first  pass.  This  two-stage  CFAR  processing  extracts  the  full  extent  of  large  targets,  while 
reducing  false  alarms. 

•  Filtering  and  streak  detection 

This  step  is  necessary  to  eUminate  isolated  false  alarms  and  as  a  pre-processing  routine  be¬ 
fore  building  detection.  Knowledge  about  the  radar  resolution  and  the  approximate  size  of 
potential  targets  can  be  used  to  group  target  pixels  into  clusters  and  get  rid  of  isolated  pixels. 
At  the  same  time,  if  the  minimum  cluster  size  is  set  too  large,  thin  streaks,  characteristic  of 
building  tops,  wiU  be  eUminated.  We  use  a  3  X  3  window,  which  is  moved  across  the  image, 
retaining  groups  of  at  least  four  pixels.  Thereafter,  a  Hough  transform  technique  is  used  to 
detect  dominant  Unear  streaks  formed  by  the  remaining  target  pixels.  A  relative  maximum 
operator  is  used  to  detect  aU  the  peaks  in  Hough  space,  and  the  corresponding  target  pixels 
are  passed  on  to  the  building  detection  stage. 

•  Supervised  ML  segmentation 

Training  areas  are  used  to  determine  the  polarimetric  covariance  matrix  parameters  of  possible 
terrain  classes.  We  train  on  nearly  homogeneous  patches  of  clearing,  grass,  road,  shadows 
and  trees,  obtained  from  elsewhere  in  the  data  sequence.  A  ML  estimate  of  the  region  label 
for  each  pixel  is  obtained  by  maximizing  the  joint  conditional  density  function  of  (6),  over 
a  local  (3  x  3)  neighborhood.  Target  pixels  detected  after  CFAR  processing  and  subsequent 
filtering,  are  not  considered  in  the  joint  distribution  characterization. 

•  Building  detection 

Shadows  in  high  resolution  SAR  terrain  images  are  caused  primarily  by  buildings  and  trees. 
Isolated  pixels  and  small  regions  classified  as  shadow  by  the  ML  segmentation  algorithm  are 
eUminated  by  a  morphological  erosion  operation  foUowed  by  a  dilation  operation.  The  re¬ 
maining  shadow  regions  are  grown  using  a  EucUdean  distance  criterion  on  neighboring  pixels. 
Those  pixels  which  border  a  shadow  region,  and  are  close  to  their  neighbors  (of  the  shadow 
class)  in  terms  of  the  EucUdean  distance  between  their  respective  intensity  measurement  vec¬ 
tors,  are  included  in  the  shadow  class.  This  step  is  appUed  repeatedly  until  there  are  no 
further  changes  in  the  pixel  labels.  FinaUy,  those  shadow  regions  which  are  bounded,  to  a 
large  extent,  by  a  streak  of  bright  target  pixels  extending  toward  the  sensor  are  declared  as 
building  shadows. 

•  Road  extraction 

SmaU  regions  are  eUminated  and  larger  ones  are  fiUed  out  using  a  series  of  binary  morpholog¬ 
ical  dilation/erosion  operations  on  the  road  regions  generated  by  the  ML  segmentation  step. 
Bounding  rectangles  of  the  appropriate  orientation  are  fitted  to  the  remaining  regions,  after 
a  connected  components  algorithm  is  used  to  label  each  region  uniquely.  Possible  ribbon¬ 
shaped  road  segments  are  detected  using  appropriate  thresholds  for  the  aspect  ratio,  length 
and  fractional  fiU  of  these  bounding  rectangles.  The  justification  for  this  step  is  that  road 
regions  are  expected  to  be  elongated,  nearly  rectangular  ribbons,  and  not  arbitrarily  shaped 
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blobs.  Detected  road  segments  are  extended  on  either  side  by  including  previously  detected 
road  regions  which  may  have  failed  the  above  tests. 

•  Modified  labeling  of  natural  features 

ML  segmentation  is  done  again  for  those  pixels  which  were  previously  misclassified  as  road 
or  shadow,  but  this  time  without  including  the  shadow  and  road  classes.  Shadow  regions 
and  bodies  of  water  can  be  resolved  at  this  stage  using  a  size  argument,  based  on  the  radar 
depression  angle  and  the  approximate  height  of  the  tree  canopy.  Verification  of  trees  is  done 
by  eliminating  small  tree  regions  and  looking  for  tree  shadows  (extending  away  from  the 
sensor)  associated  with  the  remaining  ones. 

•  Re-evaluating  target  clusters 

Target  streaks  corresponding  to  detected  buildings  are  eliminated  from  further  consideration 
as  possible  targets.  A  filtering  step  is  performed  to  eliminate  isolated  target  pixels  generated 
by  this  operation.  We  use  a  moving  window  of  size  5  X  5  to  retain  only  those  target  clusters 
which  contain  at  least  ten  pixels.  Bounding  rectangles  are  placed  around  the  remaining  target 
clusters  to  aid  in  target  recognition  and/or  facilitate  SAR  data  compression. 

The  steps  in  our  2-D  site  model  construction  algorithm  are  shown  in  Figures  9-10.  An  original 
SAR  image  with  moderate  urban  clutter  is  shown  in  Figure  9.  Figure  11  shows  the  possible  targets 
detected  after  a  two-pass  CFAR  detector,  followed  by  a  filtering  step  to  remove  isolated  target 
pixels.  This  result  was  obtained  using  an  OS  CFAR  detector  for  K  distributed  clutter,  with  shape 
parameter  v  =  1.5,  on  each  polarization  channel,  followed  by  a  logical  OR  operation.  The  desired 
false  alarm  probabilities  were  chosen  to  be  10“^  and  10“^  respectively  in  each  pass.  The  initial  ML 
segmentation  result,  with  a  complex  multivariate  Gaussian  assumption  for  the  backscatter  vector, 
is  shown  in  Figure  12.  The  target  pixels  detected  after  CFAR  processing  have  not  been  included 
in  the  segmentation  step.  At  this  stage  there  are  many  pixel  misclassifications,  especially  those 
which  are  classified  as  roads  and  trees.  Moreover,  no  buildings  have  been  detected  yet.  Figure  12 
shows  the  result  of  shadow  growing  followed  by  building  detection.  The  extracted  road  is  shown  in 
Figure  13,  while  those  tree  regions  which  are  supported  by  the  presence  of  tree  shadows  extending 
away  from  the  radar  are  shown  in  Figure  14.  Finally,  Figure  10  shows  the  complete  2D  site  model 
constructed  from  the  polarimetric  data.  This  figure  also  shows  possible  target  clusters,  surrounded 
by  rectangles,  which  could  be  used  in  radar  image  compression.  Figure  15  shows  a  larger,  wide-area 
SAR  image  and  the  result  of  our  site-model  building  algorithm  on  this  image  is  shown  in  16.  Once 
again,  the  final  segmentation  result  shows  white  rectangles  placed  around  possible  target  clusters. 


5  Summary  and  Conclusions 


We  have  investigated  some  of  the  key  issues  involved  in  the  analysis  of  SAR  images.  Our  research 
efforts  have  resulted  in  an  integrated  set  of  algorithms  that  can  produce  annoted  2-D  site  models 
of  high  resolution  SAR  images.  These  site  models  are  useful  in  context  based  exploitation  of  SAR 
images  and  ATR  applications. 

Future  research  issues  of  interest  are:  more  effective  utilization  of  phase  history  in  polarimetric 
data,  use  of  interferometric  data,  and  algorithms  for  registration  of  SAR  images  to  site  models 
for  automatically  delineating  regions  of  interest,  and  fusion  of  SAR  FLIR  and  IFSAR  data  for 
multisensor  exploitation  using  site  models. 
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Figure  9:  SAR  image  with  urban  clutter 


Figure  10:  Complete  2D  site  model 


Figure  13:  Extracted  road  Figure  14:  Verified  trees 


1 

Sliadov 

Bailding 

Rnaa 

Tree 

Ta^et 

14 


Figure  15:  Original  wide-area  SAR  image 
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